
We would like to thank the reviewers for their detailed comments and feedback. There are two contributions of [14]. One of them is the nWE baseline, which our approach outperforms. But it barely has any improvement. Moreover, we exploit image similarities as well in this approach.